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Abstract 

In distributed target tracking in wireless sensor networks (WSN), agreement on the target state is 
usually achieved by the construction and maintenance of a communication path. Such an approach lack 
robustness to failures, and is not applicable to asynchronous networks. Recently, methods have been 
proposed that can solve these problems using consensus algorithms. However, these methods suffer 
from at least one of the following problems: i) they do not use fastest consensus methods, and ii) they 
cannot handle all parametric and nonparametric likelihood functions. In this paper, we propose a general 
framework for target tracking using distributed particle filtering (DPF) based on three asynchronous 
belief consensus (BC) algorithms: standard belief consensus (SBC), broadcast gossip (BG), and belief 
propagation (BP). Since DPF can be also solved (without consensus) by exchanging the observed data, 
we determine under which conditions BC-based methods are preferred. Finally, we perform extensive 
simulations to analyze the performance of these methods. Our main result is that DPF-BG and DPF-SBC 
provide the best performance in terms of root-mean square error, and that DPF-BP provides the best 
performance in terms of disagreement in the network. 
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Index Terms 

Belief consensus, belief propagation, broadcast gossip, distributed target tracking, particle filtering, 
nonparametric methods, wireless sensor networks. 

I. Introduction 

Distributed tracking in wireless sensor networks (WSN) is important tasks for many applica- 
tions in which central unit is not available. For example, in emergency situations, such as fire or 
nuclear disaster, WSN can be deployed to detect these phenomena. Once the phenomena is de- 
tected (e.g., increased temperature, or radioactivity), the sensors start to sense their neighborhood 
and cooperatively track people and assets. As sensors are low-cost devices that may not survive 
during deployment, it is important to achieve tracking in a manner that is fully asynchronous and 
robust to sensors failures, and in such a way that every sensor has the same belief of the target 
location. Moreover, due to the nonlinear relationships and possible non-Gaussian uncertainties, 
a particle filtering (PF) should be applied |TJ, instead of traditional methods based on Kalman 
filtering (KF) (2). 

Most of the methods for PF-based distributed target tracking in WSN are based on the 
construction and maintenance of the communication path. For example, in [|3j, low-power sensors 
pass the parameters of likelihood function to the high-power sensors, which are responsible to 
manage the low-power nodes. In Q, a set of uncorrelated sensor cliques is constructed, in which 
slave nodes have to transmit Gaussian mixture parameters to the master node of the clique. 
Master node performs the tracking, and forward estimates to another clique. In pj, a Markov- 
chain distributed PF is proposed, which does not route the information through the graph during 
tracking. However, it requires that each node knows total number of communication links and 
the number of communication links between each pair of nodes, which can be obtained only 
by aggregating the data before tracking. These, routing-based^ algorithms lack robustness to 
failures and are also not suitable for ad-hoc sensor networks. 

'Authors sometimes use the term "message passing" I5J-I2] for this type of methods. This can be confusing with the standard 
message passing method, belief propagation, which does not belong to this category. 
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To address these problems, several authors have considered using average consensus algo- 
rithms. In [|6j, the global posterior distribution is approximated with a Gaussian mixture, and 
consensus is applied over the local parameters to compute the global parameters. Similarly, [J7J, 
[|8j uses a Gaussian approximation instead of Gaussian mixture. Randomized gossip consensus 
was used in j9j, [ 10 1 for distributed target tracking. Finally, as a benchmark, we also mention 



the non-centralized PF (NCPF) [11], in which each node broadcasts measurements until all the 
nodes have complete set of measurements. Then, each node (acting like a fusion center) performs 
the tracking. Although this method is not scalable, it can be still competitive in some scenarios. 
These state-of-the-art methods suffer from at least one of the following problems: i) they do not 
use the fastest consensus methods, and ii) they cannot handle all parametric and nonparametric 
likelihood functions. 

In this paper, we propose and evaluate a general framework for target tracking using distributed 
particle filtering (DPF) based on three asynchronous belief consensus (BC) algorithms: standard 
belief consensus (SBC), broadcast gossip (BG), and belief propagation (BP). While parametric 
variants of DPF-SBC have been already used, DPF-BG and DPF-BP are, to the best of our 
knowledge, novel for distributed target tracking. We also determine when it is beneficial to use 
proposed DPF methods over NCPF, and analyze the DPF-BP method in loopy graphs. Finally, 
we performed extensive simulations. Our main result is that DPF-BG and DPF-SBC provide the 
best performance in terms of root-mean square error (RMSE), and that DPF-BP provides the 
best performance in terms of disagreement in the network. 

The paper is organized as follows. In the next section, we review centralized target tracking. 
In Section 3, we propose belief consensus algorithms for PF-based distributed target tracking. 
Simulation results are shown in Section 4. Finally, Section 5 provides our conclusions and 
suggestions for future work. 

II. Overview of Centralized Target Tracking 

We consider the problem of tracking a target in WSN. We assume that there is a number of 
static sensor nodes with known positions and one moving target (e.g., a person or vehicle) in 
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some surveillance area. The target may be passive, but the sensors are assumed to periodically 
make observations that depend on the relative position of the target and the sensing node. The 
goal of the WSN is to track the position and velocity of the target. In this section, we describe 
a centralized approach to solve this problem, in which all the measurements are collected by a 
sensor that acts as fusion center. 



A. System Model 

There are N s sensors with known two-dimensional (2D) positions, l n (n = 1, 2, . . . , N) and 
one target with an unknown state x t at time t. The state of the target is defined as x t = 
[%i,t %2,t ii,t %2,t\ T , where xi >t and x 2yt represent 2D position of the target, and ±i it and x 2 ,t the 
2D velocity of the target. The goal of the WSN is to estimate x t at each (discrete) time t. We 
use the following state-space model: 



x t+ i = Ax t + Bu t 

Vn,t = 9n(x t ) + V n j, 



(1) 

(2) 



where u t = [ui jt u 2tt ] T is the process noise due to the variation of the speed, y ni is local 
observation of sensor n at time t, and v nit is its observation noise. The process noise u t can 
be non-Gaussian, but since it is usually hard to measure Q, [12|, we assume a Gaussian 



approximation with sufficiently large variance (e.g., upper bound of real uncertainty), which 
is common choice. The matrices A and B are given by 



.4 



I2 




T 2 
2 12 


, B = 




2 I 2 







(3) 



where T s is the sampling interval, and I 2 and 2 represent the identity and zero 2x2 matrices, 
respectively. We denote by G t the set of the nodes that have a measurement available at time t. 
For the sake of concreteness, we assume that the measurements are distance measurements to 
the target, i.e., for n e G t , 



9n[Xt) 



(4) 
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The measurement noise u n>t is distributed according to p v (-), which is not necessarily Gaussian, 
and typically depends on measurement technique (e.g., acoustic p"3j , RSS p"4] , RF tomography 
JT5| ) and the environment. 

For simplicity, we assume ideal probability of detection for both sensing and communication 
range, but more complex models can be easily incorporated [16]. That means that a sensor 
can detect the target if the distance between them is less than predefined value r, and that two 
sensors can communicate with each other if the distance between them is less than R. Taking 
into account that radio of a node is usually much more powerful than its sensing devices [ fT7] l, 



[18], we assume R> r. 



B. Particle Filtering 

We apply the Bayesian approach for this tracking problem and recursively determine the 
posterior distribution p{x t \y\ : t) given the prior p(xt_i|yi :t _x), dynamic model p(x t \x t -i) defined 
by ([[]), and the likelihood function p(yt\x t ) defined by §2§. We assume that p(x \y ) = p(x ) is 
initially available. The posterior can be found using the prediction and filtering equations [[TJ: 



pW^,) = J pM^pMrn^-i (5) 

p(x t \y 1:t ) oc p(yt\xt)p(x t \yi;t-i). (6) 

Assuming independence among each measurements at time t, the global likelihood function 
p(yt\x t ) can be written as the product of the local likelihoods: 

p(y t \x t ) oc Yl P(yn,t\ x t)- (7) 

neGt 

For notational convenience we will still write p(y n ,t\ x t) f° r n ^ Gt, with the tacit assumption 
that this function is identically equal to 1. 

Since the measurement noise is generally not Gaussian, and the measurement is not a linear 
function of the state, a traditional Kalman filtering (KF) [[TJ, (2j approach can not be used. Instead, 
we apply the particle filter (PF) (TJ, in which the posterior distribution is represented by a set of 
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Algorithm 1 Centralized PF (CPF) (at time t) 
l: for all particles m = 1 : N m do 
2: Draw particle: x[ m ^ ~ p(x t |x| m ]) 
3: Compute weight: wi' = w^2\ ■ p(y t \x^) 
4: end for 

5: Normalize: = w\ ! / X] ^l™' 1 (f° r m = 1 : ^m) 

m 

6: Compute estimates: x t = Yl, w t x t 

m 

7: Resample with replacement from {w^f, x^ > }^1 1 



samples (particles) with associated weights. A well-known solution is the sample-importance- 
resampling (SIR) method, in which the particles are drawn from p(x t \x t -i), then weighted by 
the likelihood function, p(y t \x t ), and finally resampled in order to avoid degeneracy problems 
(i.e., the situation in which all but one particle have negligible weights). More advanced versions 



of PF also exist [19|-[21|, but we focus on SIR since the distributed implementation of most 
PF-based methods is similar. We will refer to PF with SIR as centralized PF (CPF). The CPF 
method is summarized in Alg. [TJ 

This algorithm is run on one of the nodes in the WSN, which serves as fusion center. The main 
drawbacks of the CPF are: i) large energy consumption on the nodes which are in proximity of the 
fusion center, ii) high communication cost in large-scale networks; iii) the posterior distribution 
cannot be accessed from any node in the network; and iv) fusion center has to know the locations, 
observations, and observation models of all the nodes. In the following section we will focus on 
distributed implementations of PF method, which alleviate these problems. 

III. Distributed Target Tracking 

Our goal is to track the target in a distributed, asynchronous way, such that all the nodes have 
a common view of the state of the target. 

A. Distributed Particle Filtering 

For a distributed implementation of the PF, we want to avoid exchanging measurements and 
to have a common set of samples and weights at every time step. If we can guarantee that 
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the samples at time t — 1 are common, and the weights at time t are common, then common 
samples at time t can be achieved by providing all nodes with the same seed for random number 
generation, so as to ensure that their pseudo-random generators are in the same state at all times. 
Ensuring common weights for all nodes can be achieved by means of a belief consensus (BC) 
algorithm. BC formally aims to compute, in a distributed fashion the product of a number of 
functions over the same variable 

N s 

BC(A(x), f 2 (x), . . . , f Ns (x)) = J] f n (x). (8) 

n=l 

However, most BC algorithms are not capable to achieve exact consensus in a finite number 



of iterations (except BP-consensus in tree-like graphs; see Section III-B). As we require exact 
consensus on the weights, we additionally apply max-consensusH [9J, [22 1, 



MC(Mx), f 2 (x), f Ns {x)) = max/ n (x), (9) 

n 

which computes the exact maximum over all arguments using the same asynchronous protocol 
as average consensus in a finite number of iterations (equal to the diameter of the graph). This 
idea has been already used in |9l for gossip-based consensus. The final algorithm is shown 
in Alg. [2} Observe that, in contrast to CPF, the following drawbacks have been removed: i) 
energy consumption is balanced across the network; ii) reduced communication cost in certain 
scenarios (see later in the paper); iii) every node has access to the posterior distribution; and iv) 
no knowledge required of the locations, observations, or observation models of any other node. 
In the next section, we will describe three distinct BC algorithms. 

B. Belief Consensus Algorithms 

Our goal is to approximate the product of the local likelihoods using BC algorithms. Motivated 



by their scalability, asynchronous behavior and robustness to failures [23 1-[25 1, we consider three 



variants of BC: standard BC (SBC) [24], BC based on broadcast gossip (BG) [25 1, and BC based 



2 Min-consensus can be also applied. 
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Algorithm 2 Distributed PF (DPF) (at node n, at time t) 
l: for all particles m = 1 : N m do 
2: Draw particle: x^ ~ p(x nyt \x ( ^\) 
3: Compute weight: w^f = 

w { ™\ ■ BC (p(?/ lit |xl m) ), ...,p(2/iv s ,tkt m) )) 
4: end for 

5: Normalize: wffl = w^ t /Yl w< nt (f° r rn = l : N m ) 

m 

6: Compute estimates: x n> t = X] w i™' >a; i™' > 

m 

7: ^ = MC (w%\ ...,w<Z) (for m = 1 : N m ) 

8: Normalize: = w^'/E^™' (fo r m = 1 : ^m) 

m 

9: Resample with replacement from {^ ,2^ /^=1 



on belief propagation (BP) (23), |(26]|. 



ij Standard BC: Standard BC (SBC) [24| is defined in following iterative form: 



where G n is the set of neighbors of node n, represents current estimate (at iteration i) of 
the global likelihood of the variable x t (in our case, x t G {x t , . . . ,x[ Nm ^}), and e depends on 
maximum node degree in the network (0 < e < l/^ max , where r] max is maximum node degree 
in the network). For convenience, we define the update rate £ (0 < £ < 1), so e = £,/r] mfai . 
Update rate £ « 1 is expected to provide the fastest convergence p2|. Note that logarithm of 



TOJ) represents standard average consensus algorithm [22|. We initialize by 



MW{xt)=p(y nit \x t ). (11) 

This consensus algorithm guarantees convergence (in all connected graphs) as number of itera- 
tions goes to infinity p4| . Thus, it asymptotically converges to the geometrical average of the 
local distributions: 

lim M®(x t ) = I TT p{y njt \xt) J , (12) 

VneGi / 
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from which the desired quantity, rineG t P(Vn,t\ x t)^ can easi ly be found, for any value of x t G 

l X i > • • • 5 X t 1 ■ 

If the maximum node degree (77 max ) and number of nodes (N s ) are not known a priori, we 
need to estimate them in distributed way. The estimation of maximum node degree can be done 



using max-consensus, while N s can be determined |27| by setting the initial state of one node to 



1, and all others to 0. By using average consensus [22], they can obtain the result 1/N S , which 
is the inverse of the number of nodes in the network. 

It is also worth nothing that we can alternatively apply a number of parametric approximations, 
in which we represent beliefs in parametric form (e.g., Gaussian, or Gaussian mixture), and make 
consensus on their parameters. A number of these methods (e.g., [|6)-[[8j) represent specific 
instances of the method described here, which we will name DPF-SBC. 

2) BC based on Broadcast Gossip: Gossip-based algorithms [ |28| can also achieve consensus 
in asynchronous networks. In order to use the broadcast nature of WSN, we choose broadcast 



gossip (BG) [25]. It has been shown [25 1 that this method is significantly faster than other well- 
known gossip-based methods, such as randomized gossip p9j , and geographic gossip [30|, in 
which only one pair of the nodes update its state per iteration. In broadcast gossip, it is assumed 
that all the nodes has internal clock which ticks independently according to a rate of e.g., a 
Poisson process [25]. When the clock of the n-th node ticks, node n broadcasts its own state 
value. This state value is received by all neighbors within communication radius R. Then, these 
nodes will make weighted average of their current state value and the received state value. It 
has been shown $25\ that BG converges, in expectation, to the real average value. 



For the belief consensus, we need to achieve convergence to the geometrical average (12), so 
at the A;-th clock tick of node n all the nodes make the following operation: 

X \xt)i otherwise 



where < 7 < 1 is the mixing parameter. It has been shown [25] that optimal value of 7 depends 
on the algebraic connectivity of the graph (which represent the second smallest eigenvalue of the 
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Laplacian matrix p2|, p5|). However, this parameter is not available in distributed scenario, so 



empirical study has been used [25| to find the optimal value of 7. Therefore, we will model 7 as 
function of average node degree fj in the network, since fj can be easily estimated in distributed 
way. Initialization is exactly the same as for SBC. We also need to apply average consensus to 
estimate N s and fj. We refer to this variant of DPF as DPF-BG. 

We make one difference comparing with standard BG. In order to have the same communica- 
tion cost per iteration, we assume that one SBC iteration corresponds to N s BG iterations (i.e., 
i = \k/N s ~\). This assumption is reasonable taking into account that, to avoid collisions, even 
SBC has to broadcast its data in sequential wayj^] 



3) BC based on Belief Propagation: Belief propagation (BP) [23], [26 1 is well-known message 



passing algorithm on an undirected graphical model. Consider the following function 

Ifp(2/n,t|av) ] [ 6(x nt t - x U)t ), (14) 

which is equal to rineGt P(yn,t\ x t), whenever all the dummy variables are the same. Running 
BP on the corresponding graph yields the marginals M n (x n t ) = C Y[ n p(y n ,t\ x n,t) for every 
n, where C is a normalization constant. Note that this normalization constant is irrelevant as 
weights in Alg. [2] will be normalized later anyway. The BP message passing equations are now 
as follows: the belief at iteration i (the current approximation of C Y[ n p(yn,t\ x n,t)) is given by 



(T6| eq. (8)] 

-^■n\ x n,t) oc 

P(y n ,t\ x n,t) ] [ m£(x„,t), (15) 



while the message from node u e G n to node n is given by [16 eq. (9)] 



m^Kxnj) oc / 0{x njt -x Uit ) — 7— tt dx u>t (16) 



(j-l)f \ 

TTlnu \ x u,t ) 



Mt 1] (x n >) 



TTlnu \ x n,t) 

3 It is also possible to use a faster protocol such as collision broadcast gossip J31I, which is also valid for SBC and BP. 



(17) 
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We note that since all dummy variables are the same, we can write x n i = x u>t = x t . Some 
manipulation yields (see the Appendix) 



(18) 



which represents novel consensus algorithm based on BP. This method is initialized by Mn (x t ) = 

(2) J I 

p(yn,t\ x t)- We also need to set MA (x t ) in order to run the algorithm defined by (18). Using 



<|15[) and (Il6j), and assuming that mS|(j; t ) = 1, we find 



M n\ x t) =p(y n ,t\ x t) n p(yu,t\ x t) 



(19) 



It has been already shown that this algorithm guarantees convergence to C\\ n p(y n ,t\ x t) f° r 



cycle-free network graphs p6[ , p2[ |, p3| . When the network graph has cycles, the beliefs are 
only approximations of the true marginals (more details in Section III-C[ ). Comparing (18) and 
( fTO] ), we can see that SBC is not specific instance of BP. In contrast to SBC, BP-consensus 
agrees on product of all local evidences (not the iV s -th root of the product), and does not rely 
on knowledge of ^ max and N s . We refer to this variant of DPF as DPF-BP. 

Finally, it is important to mention that there is alternative specific instance of BP, called 



consensus propagation [34], in which all single-node and pairwise potentials are assumed to 
be Gaussian. This method can be useful for other applications in which the beliefs can be well 
approximated with Gaussian distributions. In that case, and if there is no significant nonlinearity, 
a distributed Kalman filter (DKF) |35| should be used, instead of DPF. 



C. Convergence behavior in Loopy Graphs 

As already mentioned in the previous section, the convergence behavior of SBC and BG is 



already analyzed in the literature [24|, [25|. However, BP consensus in loopy graphs has not, so 
we provide a deeper analysis in this section. 



It is already well-known [26 1 that BP consensus (as a special case of standard BP) converges 
to the exact solution after a finite number of iterations in cycle-free graphs. Using an appropriate 
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message schedule, this number of iterations is equal to D g + 1, where D g is the diameter of the 
graph (i.e., the maximum hop-distance between any two nodes). However, for general graphs, it 



is straightforward to show (using equation (18)) that the beliefs of BP consensus after D g + 1 
iterations is given by: 

M^ +1 \x t ) oc J] piyuAxtY^ (20) 

u&Gt 

where a UtUjt > 1 is an exponent (a„ in>4 G N) of node pair (u, n) at time t. In case of cycle-free 
graphs a U)n> t = 1, so the estimated belief is equal to desired global likelihood (given by (|7])). In 
case of a Ut t, n > 1, the observation from node u at time t is over-counted at node n. To understand 
the overcounting behavior, we determine a max , the maximum value (maximized over n and u) 
of ct u ,n,t after D g + 1 iterations. Note that running more than D g + 1 iterations is unnecessary, as 
it will increase the a-values. While for the general case this problem is hard, we limit ourselves 
to some best- and the worst-case examples. In particular, we consider 4 representative graph 
configurations, shown in Figure [T] 

1) Fully-connected graph (clique): For the example in Figure [1^, D g = 1, so the belief at 



second iterations is given by (19). Since the graph is fully-connected, we know that set G n 
includes all nodes in the graph except node n (which is locally available). Therefore, « max = 1, 
so BP consensus is correct. 

2) Single-cycle graph with even number of nodes: For the example in Figure |TJd, D g = 2, so 
we need to run 3 iterations of BP. In the second iteration, node 1 will obtain likelihood from 
nodes 2 and 3, but in the third iteration it will obtain likelihood from node 4 twice (through 
nodes 2 and 3). Therefore, a max = 2. 

3) Single-cycle graph with odd number of nodes: For the example in Figure [TJ:, again D g = 2, 
so we need to run 3 iterations of BP. In the second iteration, node 1 will obtain likelihood from 
nodes 2 and 3, and in the third iteration it will obtain likelihood from nodes 4 and 5. Therefore, 
« max = 1, so BP consensus is correct. 

4) Graph with short loops: For the example in Figure [TJi, D g = 3, so we need 4 iterations of 
BP. After 4 iterations, nodes 1 and 6 will have triple-counted their own local likelihoods (since 
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it has its own information, as well as messages received due to the clockwise and counter- 
clockwise circulation through short loopj^] 1-6-7). Therefore, a max = 3. This reasoning can be 
generalized to a case with iV s h short loops (which all contain the edge 1-6), a max = 1 + 2AT sh . 

Taking into account that case 4) is the worst-case scenario, we can conclude that in the worst- 
case ct max = 1 + 2iV s h. This is not a promising conclusion, since a max can be unbounded, for 
fixed D g , as the number of nodes grows. However, with good deployment, highly asymmetrical 
configurations can be avoided. 

D. Communication Cost Analysis 

In this section, we analyze the communication cost of the three DPF methods, and compare 
with the cost of NCPF and CPF. We denote by iV pack the number of packets that a generic node 
n broadcasts at a generic time t. We assume that one packet can contain P scalar values. We 
neglect the cost of determining (r] max , fj and N s ), so that all DPF methods will have the same 
communication cost. 

1) Cost of DPF: At every iteration (except the first), nodes transmit N w weights. In addition, 
nodes must perform max-consensus (MC), which also requires transmission of the weights in 
each iteration. The number of iterations of the BC is N it . The number of iterations of the MC 
is equal to the to the diameter of the graph D g . Thus, the average cost of DPF per node and per 
time slot is 

(D g + N it -1). (21) 

2) Cost of NCPF: NCPF does not require transmission of the weights, but only local data, 
i.e., its observations and its 2D position^] We denote the number of these scalar values as iVdata- 
The amount of data will accumulate with iterations since the node has to transmit its own data 
and all received data. Since the number of iterations is equal to D g , the cost can be approximated 

4 A short loop is defined as a loop that consists of 3 nodes. 

5 If all the sensors learn the measurement model online, learned parameters also have to be transmitted. 



P 
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by: 



jyNCPF 
pack 



D g -1 

E 

k=0 



f] k N, 



data 



P 



(22) 



where we approximate the degree of the each node with average network degree (77). 

3) Cost of CPF: The cost of CPF depends on many factors, including the routing protocol, 
and the position of the fusion center. Taking into account that in CPF each node transmits its 
information once (in contrast of D g times, in NCPF), and that the fusion center is on an edge 
of the area, the average cost can be roughly approximated with 

jyNCPF 



pack 



(23) 



Note that this cost is not evenly distributed over network. 



4) Discussion: From (21) we see that the DPF methods are fully scalable, since increasing 
the number of the nodes (by increasing its density) will not affect the cost. Although beyond 
the scope of this paper, we mention that if one prefers to use parametric approximations (6J- 
(8j instead of N w messages, only parameters of the beliefs and 2D sensor positions should 
be transmitted, in each iteration. It is also possible to transmit only large weights (larger than 
predefined threshold) as in [|9j, or use other techniques for weight compression. 

Making the reasonable assumption that N it — D g + 1, we can quantify when DPF is preferred 
over NCPF, i.e., when < N™™: 



1 »t,-l 



k=0 



y^Vdata 
P 



(24) 



This condition is important in order to avoid over-using of consensus-based methods. For 
example, if the network is fully-connected (D g = 1), or if the packet size is sufficiently large 
to afford transmission of all accumulated data (i.e., P > r/ Dfl ~ 1 A' d ata), NCPF should be applied. 
On the other hand, if the communication radius is very small (i.e., if D g is very large), DPF 
methods should be applied. Note that a similar comparison can be done with CPF (i.e., using 



(21 ) and (23)), but note that the communication cost is not the unique reason why CPF method 
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is not used (see Section jlT-B]). 



IV. Simulation Results 



A. Simulation Setup and Performance Measures 

We assume that there are N s = 25 sensors semi-randomly deployed in a 100m x 100m area: the 
area is divided into N s square-shaped cells, and one sensor is randomly placed in each of them. 
The positions of these sensors are perfectly known. There is also one target in the area which 
is moving with constant speed V = 5m/s according to a Gaussian random walk. The sampling 
interval is set to T s = Is, and number of these intervals is set to N t = 50. We set the sensing 
radius to r = 25m, and vary the communication radius R. We assume that the measured distance 
is distributed according to Gaussian mixture with two components, in which one component is 
outlier. The parameters of this noise are set to following values: /i^ = (lm, 10m), = (lm, lm) 
and w d = (0.75,0.25). We use N p = 200 particles. The results are averaged over N mc = 100 
Monte Carlo runs. 

We will compare CPF, NCPF, and the three DPF methods (DPF-SBC, DPF-BG, and DPF-BP). 
We consider two performance metrics: root-mean-square error (RMSE) in the position error e rms , 
and, for DPF methods, the average disagreement in the position error edi S - Introducing e n ^ s as 
the target positioning error (i.e., Euclidean distance between the true and estimated position of 
the target) at node n, at time t in simulation run s, we have 



B. Performance Results 

Having defined scenario, we need to find reasonable values of the update rate £, and the 
mixing parameter 7. We analyzed e rms and edi S of DPF-BC and DPF-BG w.r.t. these parameters, 
for different values of communication radius. We found that, as expected, £ « 1 consistently 




(25) 



and 




mc 



(26) 



s,t 
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provides the best performance, so the SBC exponent is set to e = 1/ max n (?7 n ). On the other 
hand, the best value of 7 increases with the communication radius. We have modeled 7 as 
function of average node degree in the network fj by: 

7(77) = l-ae^G (0,1), (27) 

where a = 0.49, and b = 0.17 were found by fitting using training data. 

We will now investigate the convergence as a function of the number of iterations, for R = 25m 
and R = 45m. From Figure |2} we draw a number of conclusions. First of all, CPF and NCPF 
provide the best RMSE performance, as they have access to all observations. Among the DPF 
methods, DPF-BG and DPF-SBC provide better RMSE performance than DPF-BP, as the latter 
algorithm is affected by the loops in the factor graph, leading to biased beliefs. On the other 
hand, DPF-BP offers the fastest convergence. This is expected since it is empirically known 



[26 1, [32 1 that BP often converges after a finite number of iterations (in our scenario, usually 



for Ni t ~ D g + 1). In fact, using (18), it is straightforward to see that N it = D g + 1 leads 
to a minimal RMSE, since then all local likelihoods are available at each node. A further 
increase of the number of iterations will only increase the amount of over-counting of the local 
likelihoods, thus leading to biased beliefs. DPF-SBC is consistently the slowest method in terms 
of disagreement, but it is the unique DPF method that guarantees the convergence in terms of 
the both metrics. 

Secondly, we will vary the communication radius R, and fix N it = \L/R\ + 1, as an 
approximation of N it — D g + 1. Here L is the diameter of the deployment area (L = lOOv 7 ^ 
m, in our case). As we can see in Figure [3j DPF-BG and DPF-SBC achieve the best RMSE 
performance, close to the RMSE of CPF/NCPF for large R. On the other hand, DPF-BP performs 
the best in terms of disagreement, for all considered values of R. However, DPF-BP performs 
poorly in terms of RMSE. Note that if we use exactly D g + 1 iterations, the performance of 
DPF-BP will be significantly better]^ Of course, in practice the network will have no knowledge 

6 In our example, according to Figure 3^ and Figures 2 1 and 2^, \L/K\ = D g only for R = 25m. 
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of D g . Finally, we can also see that DPF-SBC provides very good agreement for large R. 

Finally, we will evaluate the communication cost, and analyze the average number of packets 
per node as a function of the communication radius R for A^ t = \L/R~\ + 1. We consider 
networks with 25 and 100 nodes, and packets sizes of P = N w , and P = 5N W , where N w = 
N p = 200, and iVdata = 3. As we can see in Figure [4| DPF-based methods provide nearly 



constant communication cost as a function of R, since (21) only depends linearly on D g . By 
comparing Figures [4^ and |4j5, and Figures |4j: and HH, we can see that for DPF-based methods 
the communication cost does not depend on N s . Thus, these methods are fully scalable. On the 
other hand, the communication cost of CPF/NCPF is highly sensitive to R and N s . It increases 
as R increases (while D g = \L/R~\ is fixed), and decreases significantly when D g decrements 



its value (e.g., for R = 50y/2). Overall, decreasing D g has the largest effect (see (22)), so 
the total cost has decreasing tendency with R. In addition, since the increased N s affects fj, 
the communication cost will be significantly larger. Regarding the effect of P, we can see that 
larger values of P will make CPF/NCPF cheaper, as more data can be aggregated in one packet. 
Finally, comparing with NCPF, we can see that DPF methods have a lower communication cost 
for R < 70m, except when P is very large (as in Figure Hb). 

As an aside (without numerical comparison) we mention that other asynchronous tracking 
methods based on average consensus (6j-|(8} perform worse than DPF-SBC due to the likelihood 
compression (parametric approximation), and that methods based on randomized gossip [|9j, p0 | 
perform significantly slower than DPF-BG. 

V. Conclusion 

We have presented three particle filter methods for distributed tracking in WSN: DPF-SBC, 
DPF-BG, and DPF-BP. In contrast to previous methods, these methods can approximate global 
likelihood function in nonparametric form. According to our results, DPF-BG should be used 
in all tracking applications where minimal expected error is crucial. On the other hand, if the 
agreement of the estimates in the networks is more important than absolute error, DPF-BP could 
be a good choice. DPF-SBC, which guarantees convergence after large number of iterations, 
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could be applied when the cost and latency are not crucial. There remain many research lines 
for the future work. For example, many parametric approximations of DPF-BG and DPF-BP 
methods can be analyzed. In addition, one can consider hybrid methods based on combination 
of SBC, BG and BP, which might provide better a convergence/performance trade-off. Finally, 
it would be interesting to analyze how sensor deployment affects over-counting in DPF-BP. 

Appendix 



From (17), we know that 



hlt l \x t ) 

<l(x t ) oc v ' (28) 

rrVnu '[xt) 

where we removed index n since all the nodes have the same variable (x n>t = x U)t = x t ). The 



denominator of (28) is the message from node n to node u in the previous iteration, and can be 
expressed as 

m^\x t ) oc 4^TT- ( 29 > 
rrv U n [Xt) 

Combining previous two equations, we get the recursive expression for the messages 

m ®M K ^TT m ^ 2) ^) (30) 

MA '{x t ) 



Combining (15) and (30), we find a recursive expression for the beliefs: 



oc p(y n , t \x t )H f^4r4 m - 2) ^) (31) 



u£G„ 



(i-2)/ \ TT / Mu ^(xt) 



p(yn,t\xt) n rnL 2 \xt) n 



Mt 2 \x t ) 
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Fig. 2. Performance comparison of DPF methods as a function of the number of iterations, (a) RMSE, R — 25m, (b) avg. 
disagreement, R — 25m, (c) RMSE, R — 45m, and (d) avg. disagreement, R — 45m. 
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Fig. 3. Performance comparison of DPF and CPF/NCPF as a function of communication radius, of: (a) RMSE, and (b) average 
disagreement. 
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Fig. 4. Communication cost comparison as a function of the communication radius, for: (a) 25-node network, P = N w , (b) 
100-node network , P = iV^,, (c) 25-node network, P — 5N m , and (d) 100-node network, P = 5N W . 
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